Model Selection

ViT Backbone Network

# ViT Backbone Network

Checkpoint Aerial Mast3r

AerialMegaDepth is a deep learning model focused on aerial-ground reconstruction and view synthesis, capable of reconstructing 3D scenes from aerial images and generating new viewpoints.

Dpt Large Ade20k

A Transformer-based semantic segmentation model optimized for the ADE20K dataset

Image Segmentation

Vit Base Patch16 224.orig In21k

An image classification model based on Vision Transformer, pretrained on ImageNet-21k, suitable for feature extraction and fine-tuning

Image Classification

Samvit Base Patch16.sa1b

Segment-Anything Vision Transformer (SAM ViT) image feature model, which only includes feature extraction and fine-tuning capabilities, without a segmentation head.

Image Segmentation

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase